AIbase
Home
AI Tools
AI Models
MCP
AI NEWS
EN
Model Selection
Tags
Multitask Speech Processing

# Multitask Speech Processing

Meralion AudioLLM Whisper SEA LION
Other
A speech-to-text large language model customized for Singapore's multilingual and multicultural environment, integrating Whisper-large-v2 speech encoder and SEA-LION V3 text decoder
Text-to-Audio Transformers
M
MERaLiON
2,828
12
Kotoba Whisper Bilingual V1.0
Apache-2.0
Kotoba-Whisper-Bilingual is a distilled model collection trained from the Whisper model, specifically designed for Japanese and English speech recognition and speech-to-text translation tasks.
Speech Recognition Transformers Supports Multiple Languages
K
kotoba-tech
782
13
Fsmn Vad
Other
FunASR is a foundational toolkit dedicated to bridging academic research and industrial applications in speech recognition, supporting various functions such as speech recognition, voice activity detection, and punctuation restoration.
Speech Recognition
F
funasr
107
17
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
English简体中文繁體中文にほんご
© 2025AIbase